DESCRIPTION 
TRAP VECTORS AND GENE TRAPPING USING THE SAME 
TECHNICAL FIELD 

The present invention relates to random mutation ES clone technology using gene 
trapping. 

BACKGROUND ART 

It is said that structural analysis of human genome will be completed in or before 2003 
as the human genome project is progressing well. Now, the age of isolating genes one by 
one and analyzing their structures separately seems to be over, and we have come into the 
age of "structural analysis" of genome. 

With the nucleotide sequence of genome alone, however, information on functions is 
insufficient. Thus, a novel analysis system for functional analysis is needed. Further, 
although one of the major goals of human genome analysis is to elucidate causative genes in 
human diseases, such diseases cannot be explained with the structures of causative genes 
alone. 

Accordingly, production of model individuals is an indispensable assignment in order 
to analyze processes of disease development and to develop new treatment methods after the 
identification of causative genes. 

On the other hand, if genome is divided into gene regions and non-gene regions in 
terms of structure, it is considered that these two parts have separate functions and that it is 
necessary to analyze the functions of both parts (Fig. 1). From the viewpoint of entire 
genome, each gene is performing only a part of the entire function. Genome is not a mere 
collection of genes and may have unknown functions. In fact, a new concept "position 
effect mutation" has been established. From this, it is presumed that genome has regions of 
unknown functions. 

Gene regions are composed of regulatory regions and coding regions. At present, the 
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target of genome functional analysis is coding regions. When mouse is compared with 
human, the kinds of genes they have are almost equal. Therefore, functional analysis of the 
regulatory region is important. There is difference in species between mouse genes and 
human genes. It is believed that this difference is not due to difference in protein but due to 
difference in the regulation of gene expression. 

The function of a transcription factor or the like involved in the regulation of gene 
expression can be elucidated from the sequence of the coding region of the relevant gene. 
The analysis of the functions of those elements to which the transcription factor binds is 
extremely difficult at present because a number of those elements exist in the regulatory 
region of one gene. However, as a technique of functional analysis, a method using 
bacterial artificial chromosomes may be considered. 

It is considered that functional analysis of coding regions may be performed at the 
mRNA level, protein level, cell level, tissue/organ level and individual (i.e. whole animal) 
level. It is believed that such analysis at the mRNA level can be performed using DNA 
chips. On the other hand, the use of embryonic stem (ES) cells seems to be the best way 
for performing functional analysis at other levels, because various cell and tissue derivative 
systems have been developed directly from ES cells in vitro and a number of such systems 
are expected to be developed in the future. Furthermore, the use of ES cells is 
advantageous in that individual level analysis systems can be established. 

From the foregoing, it is understood that gene knockout at ES cell level and 
production of knockout mice in which the relevant genes are knocked out are extremely 
important in functional analysis of genome. 

To date, homologous recombination using ES cells has played a major role in the 
production of knockout mice. However, considering this method not as a strategy of 
producing knockout mice separately but from a strategic viewpoint of producing knockout 
mice comprehensively, this method has serious problems. 

First, this method requires too much time. In the production of knockout mice, it is 
the rate-determining step to isolate knockout ES clones generated through homologous 
recombination using ES cells. Even a skillful researcher needs at least three months for 
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isolating a knockout ES clone. Thus, only four genes can be knocked out in one year. 
Accordingly, in the case of introducing each one mutation into 10^ genes, 2,500 researchers 
are required for one year. It is estimated that approximately 1 ,000 lines of knockout mice 
are produced in one year in the world. This means that it would take 100 years to produce 
10^ knockout ES clones. This is so unrealistic compared to the advance in the structural 
analysis of human genome that is to be completed in 2003. 

Secondly, this method requires too much cost. At least 2 to 4 million yen is 
necessary to produce one line of knockout mouse excluding personnel expenses and 
depreciation expenses. Thus, production of 10^ simple knockout mice requires 200 to 400 
billion yen. 

As described above, the conventional homologous recombination using ES cells has 
problems, and genome is vast. However, the number of genes in genome is limited. Thus, 
it is necessary to isolate from genome those genes having important functions. In many 
cases, the function of a gene is elucidated only after production of a knockout mouse in 
which the relevant gene is disrupted. Therefore, knockout mice are directly connected with 
future development of epoch-making drugs and have extremely high value added. Under 
circumstances, it has become the world's "strategy" to produce mutant mice at random and 
in large scale. At present, the three methods described below are considered most 
reasonable in the production of random mutation mice. 

The first one is a method using ethylnitrosourea (ENU), a mutagen. A project of 
large-scale mutant production using ENU has been started in Europe. In Germany, Dr. 
Balling of the Institute of Manunalian Genetics and others started this project in 1997 as a 
part of the human genome project. In England, supported by SmithKline Beecham, Dr. 
Brown and others started this project at MRC Mouse Genome Center in Harwell aiming at 
establishment of mutant mice having mutations mainly in brain/nervous system. To date, 
these two groups have established approximately 200 lines of mutant mice exhibiting 
dominant inheritance. The project is proceeding more efficiently than expected. In the 
United States, it has been decided that structural analysis of mouse genome and production 
of mutant by the ENU method start with a huge budget (6 billion yen/year) at Case Western 



Reserve University, Oak Ridge National Laboratory, etc. 

When ENU is administered to adult male mice, ENU acts on spermatogoniums before 
meiosis and causes about 50 to 100 point mutations per spermaogonium at random. 
Mutations occur at a frequency of 1/1,000/gamete per locus. Therefore, by crossing one 
treated male mouse with one normal female mouse, many kinds of mutant mice can be 
produced in Fl generation. In the method using ENU, if 1,000 mice are screened for a 
specific locus, one mouse has a mutation caused in that locus in terms of probability. Thus, 
this method is considered highly efficient. 

The second method is a method using chlorambucil that is also a mutagen. This 
method causes mutations in spermatogoniums at the same frequency as in the method using 
ENU. However, these mutations are deletion mutations, and sometimes as many as one 
megabases may be deleted. 

The third method is a method using gene trapping. Gene trapping is a technique that 
was developed for the purpose of searching for unknown genes by introducing trap vectors 
containing a marker gene into ES cells and then monitoring the expression of the marker 
gene. Trap vectors are integrated into ES cells at random and, as a result of their 
integration, endogenous genes (genes present in cells and tissues inherently) are disrupted in 
most cases. Therefore, preparing chimeric mice from such ES cells can produce various 
knockout mice. 

However, each of the methods using a mutagen and the method using gene trapping 
has an advantage(s) and a drawback(s) (Table 1). 



Table 1 





ENU 
Method 


' Chlorambucil Method 


: Gene Trapping 


Nature of Mutation 


Point 
mutation 


Deletion mutation 


Any desired mutation 


Production of Mutant 
Mouse 


Easy 


Easy 


Difficult 


Identification of 
Mutant Mouse 


Difficult 


Medium 


Easy 


Other Features 






Can use ES trap 
clones 
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According to the ENU method, production of mutant mice is easy, but establishment 
of individual mutant lines is not easy because segregation by crossing should be conducted. 
Further, in order to identify mutated genes, the relevant locus should be identified first by 
linkage analysis using polymorphic DNA markers, and then the gene should be isolated by 
positional cloning. Thus, the ENU method requires complicated operations. 

According to the chlorambucil method, production of mutant mice is easy, but deleted 
sites should be identified. For that purpose, analysis must be made using a number of 
polymorphic DNA markers. Besides, generally, methods using a mutagen such as 
chlorambucil need large breeding rooms. Thus, such methods require much expenses and 
labor. 

Although the gene trap method requires labor and technology for producing mutant 
mice, identification of mutated genes is easy and experiments can be conducted according to 
the size of breeding rooms. Gene trap ES clones per se are precious resource for functional 
analysis of genome. The gene trap method is also remarkably different from other methods 
in this point. 

Some laboratories in the world have already started production of mutants by gene 
trapping. In the United States, a private firm Lexicon Genetics Incorporated is undertaking 
random disruption by gene trapping using retrovirus vectors. However, ordinary 
researchers can hardly use this service because of the following reasons. Briefly, it is not 
sure whether an endogenous gene is disrupted or not even if the gene is trapped; it is not 
clear whether germline chimeric mice can be produced; an additional charge is required for 
the production of chimeric mice; and considerable charges are required for using the service. 
In Germany, gene trapping is performed toward a goal of 12,000 clones as a part of the ENU 
project. Anyway, these are proceeding focusing on the analysis of trapped genes rather 
than the establishment of mouse lines. 

DISCLOSURE OF THE INVENTION 

The problem for solution by the invention is to overcome the problems that 
conventional gene trap methods have, to develop a novel "exchangeable gene trap method" 
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that seems almost ideal, to establish ES trap clones in large scale using the above method, 
and to produce mouse mutants using the trap clones. Thus, it is an object of the invention 
to provide trap vectors; a method of gene trapping; transgenic or knockout animals in which 
a trapped gene is introduced; and trapped genes. 

As a result of intensive and extensive researches toward the solution of the above 
problems, the present inventors have reached an idea of using the bacteriophage-derived 
recombination system Crc-loxP in gene trapping. Thus, the present invention has been 
achieved. Cre is a recombinase that recognizes a loxP sequence and causes recombination 
at that site. 

The present patent application provides the following inventions: 

(1) A trap vector containing a loxP sequence composed of inverted repeat sequence 1, a 
spacer sequence and inverted repeat sequence 2 in this order, the loxP sequence being a 
mutant loxP wherein a part of inverted repeat sequence 1 or 2 is mutated. 

As a specific example of the mutant loxP in which a part of its inverted repeat 
sequence 1 is mutated, loxll (e.g. the sequence shown in SEQ ID NO: 1) may be given. 
As a specific example of the mutant loxP in which a part of its inverted repeat sequence 2 is 
mutated, lox66 (e.g. the sequence shown in SEQ ID NO: 2) may be given, 

(2) A vector generated from recombination between a trap vector containing a mutant loxP 
wherein a part of inverted repeat sequence 1 is mutated and a the trap vector containing a 
mutant loxP wherein a part of inverted repeat sequence 2 is mutated. 

(3) A trap vector selected from the group consisting of the following (a) to (i): 



(a) SD-SA-loxl 1 -IRES-M-ZojcP-PV-SP 

(b) SP-/ojc71-IRES-M-/oxP-PV-SP 

(c) SA-loxl 1 -IRES-M-/oxP-pA-PV-SP 

(d) SA-/oA:71-IRES-M-/ojcP-pMro-pA-PV-SP 

(e) loxl 1 -M-loxP-ipA-loxllll-VW-loxS 1 1 

(f) loxl 1 -IRES-M-/oxP-pA-/ojc2272-PV-/ox5 1 1 

(g) {loxl 1 -integrated S A)-M-/ojcP-p A-/ojc2272-PV-/ojc5 1 1 

(h) (/o;c7 1-integrated SA)-IRES-M-/ojcP-pA-/ojc2272-PV-/ojc5 1 1 
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(i) {loxl 1 -integrated S A)-M-/ojcP-p A-/ox2272-pronioter-M-/oJc5 1 1 -SD, 
wherein SP represents any sequence; SA represents a splice acceptor; SD represents a splice 
donor; IRES represents an internal ribosomal entry site; M represents a marker gene; puro 
represents puromycin resistance gene ; pA represents a poly(A) sequence; and PV represents 
a plasmid vector. 

In the trap vectors (a) to (i) described above, jS -geo gene may be given as a specific 
example of the marker gene, and pBR322, pUC plasmids (pUC18, pUC19, pUCllS, 
pUC119, etc.), pSP plasmids (pSP64, pSP65, pSP73, etc.) and pGEM plasmids (pGEM-3, 
pGEM-4, pGEM-3Z, etc.) may be enumerated as specific examples of the plasmid vector. 

(4) A method of gene trapping comprising introducing any of the above-described trap 
vectors into embryonic stem cells, and embryonic stem cells into which the trap vector is 
introduced by the method. 

(5) A method for producing a transgenic animal or knockout animal comprising 
introducing the above-described embryonic stem cells into an animal, and a transgenic 
animal or knockout animal produced by the method. 

As a specific example of the above animal, one selected from the group consisting of 
mouse, rat, rabbit, guinea pig, pig, sheep and goat may be given. 

Hereinbelow, the present invention will be described in detail. The present 
specification encompasses the contents of the specification and/or drawings of the Japanese 
Patent Application No. 11-200997 based on which the present application claims priority. 

The present invention relates to a method of gene trapping, transgenic or knockout 
animals into which a trapped gene is introduced, and trapped genes. An outline of the 
method of gene trapping according to the invention is shown in Fig. 2. First, in order to 
achieve the object of the present invention, a trap vector is constructed and introduced into 
ES cells, followed by isolation and selection of trap clones (Fig. 2A-C). In Fig. 2, pU- 
Hachi vector is exemplified. Subsequently, chimeric animals (e.g. chimeric mice) are 
produced, followed by production of mutant mice derived from the trap clone (Fig. 2F-G). 
On the other hand, using the trapped and selected clone, isolation and sequencing of the 
trapped gene as well as recovery of the genome by plasmid rescue are performed (Fig. 2C-E). 




Further, the clone is subjected to electroporation and selection with a drug such as 
puromycin to thereby trap a gene of interest. Then, the trapped gene is expressed, followed 
by production of ES clone-derived mouse lines (Fig. 2H-I). 

The present invention can be summarized as follows (including pilot studies). 
( 1 ) Overall Efficiency 

( 1 - 1 ) Screening by Formation of Embryoid Bodies 

One hundred and six neomycin resistant clones were suspension-cultured for the 
formation of embryoid bodies. The expression of 

/3 -galactosidase was analyzed at the stage of ES cells and after the induction of 
differentiation. As a result, it was found that 90 trap clones (86%) were expressing 0 - 
galactosidase at any one of the above stages. 
(1-2) Selection of Clones Indicating Single Copy Integration 

DNA was extracted from 109 trap clones that had expressed the marker gene during 
the process of embryoid body formation, and then integration patterns of the trap vector were 
analyzed. As a result, 75 clones (70%) had a single copy integrated. Of these, 24 clones 
(22%) were complete (i.e. retained the replication origin of the plasmid) and 40 clones (37%) 
lacked pUC. Even if pUC was lost, it could be re-inserted by using lox71 site. Therefore, 
these 64 clones (59%) were found to be useful (Table 4). 
(1-3) Efficiency of Gemiline Chimera Production 

Chimeric mice were produced using the above-mentioned trap clones. As a result, 
germline chimeric mice were obtained from approximately one half of the clones. 
(1-4) Summary of the Entire Experiment 

It was found that about 26% of the neomycin resistant clones selected initially reached 
the final stage of the experiment. Since the efficiency of germline chimera production is 
now increasing, it is believed that the overall efficiency can be increased further. However, 
the efficiency achieved at this time seems to be sufficient for the practice of researches. 
(2) Efficiency of the Gene Trap Method 

As a result of the tests so far conducted, 24 trap lined were established. Of these, 13 
lines have proceeded to analysis at the gene level. Nucleotide sequences of these lines were 




compared with GenBank and EMBL databases using BLAST program. The results were as 
follows: 9 clones individually trapped a known gene; 3 clones individually trapped an EST; 
and the remaining 1 clone trapped an unknown gene (Table 2). According to the reports so 
far made by other researchers, 10-25% of trapped genes are known genes; 10-20% are ESTs; 
50-80% are unknown genes; and 2-10% are repeats. 



Table 2 





Known Gene 


EST 


Unknown Gene 


Repeat 


Present 
Invention 


9 (69.2%) 


3(23.1%) 


1 (7.7%) 




Previous 
Reports 


10-25% 


10-20% 


50-80% 


2-10% 



(3) Trapped Genes 

The inventors examined whether those genes involved in development and cell 
growth had been efficiently trapped or not by ascertaining the kinds of known genes by a 
screening method utilizing formation of embryoid bodies. As a result, it was found that the 
known genes were CBP (CREB binding protein) and Spl that are transcription factors; 
cyclin B2 involved in cell cycle; Crk and pHPSl-2 involved in signal transduction; rRNA, 
suil, hnRNP L and RNA polymerase I; and mitochondrial DNA (Table 3). Thus, it was 
found that very common genes were trapped. A major part of these genes are involved in 
cell growth. This suggests that the screening system utilizing formation of embryoid 
bodies works well. 
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Table 3 



Class 


Clone No. 


Gene 


1 . 


Nucleus 








(1) Transcription 


Ayu3-112 


CBP 






Ayu8-038 


Spl 




(2) Cell Cycle 


AyuJ-UUo 


Cyclin 132 






Ayu6-003 


Homologous to the E, coli cell 








division protein Ftsj 1 




(3) Signal 


A3ai8-104 


Crk 




Transduction 










Ayu8-025 


pHPS,-2 




(4) Cell Skeleton 




uynainin ix 


2. 


Cytoplasm 








(1) Translation 


Ayu3-022 


rRNA 






Ayu8-016 


soil 






Ayu8-016 


Upstream region of hnRNP L 






Ayu8-019 


Very likely to be RNA polymerase I 




(2) Others 


Ayu3-001 


Mitochondrial DNA 


3. 


Unknown 


Ayu7-003 


Unknown 



(4) Confirmation of Gene Disruption by Trapping 

It is one of the major points whether endogenous genes have been actually disrupted 
or not by gene trapping. Thus, the inventors have analyzed the stmcture of the trap site for 
6 known genes. As a result, it was found that the trap vector was inserted into the promoter 
region in one gene; into an exon in one gene; and into an intron in 4 genes. In all of them, 
the gene was completely or partially disrupted. Therefore, it has become clear that 
endogenous genes can be disrupted efficiendy by the method of gene trapping of the 
invention (Fig. 10). 



Hereinbelow, the present invention will be described in more detail. 
1 . Construction of Trap Vectors 

Gene trapping is a method for trapping unknown genes on genome utilizing the fact 
that trap vectors introduced into ES cells are integrated into mouse endogenous genes 
incidentally and at random. "Gene trapping" means that a trap vector enters into a specific 
gene on genome and captures that gene. The vector for gene capturing is called "trap 
vector". Genes have enhancers, promoters, exons, poly (A) sequences, etc. The trap 



vector is capable of capturing any of them. For this purpose, a trap vector with a structure 
suitable for the specific capturing may be used. 

Generally, exon trap vectors are composed of a reporter gene with a splice acceptor 
alone, a drug selection marker gene and a plasmid. Only when these vectors are integrated 
downstream of a mouse endogenous gene, the reporter gene is expressed. This means that 
it is possible to know the vector's integration into an endogenous gene by monitoring the 
expression of the reporter gene in the trap vector. If a plasmid such as pUC19 has been 
linked to the trap vector, the trapped endogenous gene can be isolated by the technique 
called plasmid rescue. "Plasmid rescue" is a technique for recovering a gene of interest by 
selection with ampicillin, etc. of those cells transformed with electroporation or the like (Fig. 
2E). Furthermore, since the endogenous gene is disrupted at the time of trapping, knockout 
mice can be produced immediately. Further, since the reporter gene is expressed under the 
control of the expression regulatory region of the endogenous gene, the tissue specificity and 
time specificity of the gene can be analyzed easily. 

In conventional gene trapping methods, even if a mouse endogenous gene could be 
disrupted completely, it has been impossible to introduce thereinto subtle mutations, such as 
single amino acid substitution, seen in human hereditary diseases. Also, it has been 
impossible to replace the disriipted mouse gene with a human gene. Toward the solution of 
these problems, the present invention has modified the Cxt-loxP system (a bacteriophage- 
derived recombination system) and utilized it in the trap vector in gene trapping. As a 
result, it has become possible to insert any gene into a mutant loxP site of the trap vector 
after a mouse gene has been disrupted as a result of the integration of the trap vector. 
According to the present invention, it has become possible to introduce subtle mutations, 
such as single amino acid substitution, seen in human hereditary diseases. It has also 
become possible to replace the trapped gene with a human gene. The trap vector of the 
invention may be used for trapping various genes. Li particular, it may be used preferably 
for exon trapping or promoter trapping. 

loxP (locus of crossing (X-ing) over, PI) is a 34 bp sequence (5 -ataacttcgtata gcatacat 
tatacgaagttat-3*) (SEQ ID NO: 3). The 13 bases at its 5' end (called "inverted repeat 
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sequence 1") and the 13 bases at its 3' end (called "inverted repeat sequence 2") constitute 
inverted repeat sequences, which are separated by an 8 bp spacer (gcatacat) (Fig. 3). The 
term "inverted repeat sequences" used herein means that a sequence located on one side of 
the spacer is complementary to a sequence located on the other side of the spacer in opposite 
orientation. In other words, the sense strand of one sequence is homologous to the 
antisense strand of the other sequence in opposite orientation to each other. These two 
repeat sequences have opposite orientation and, when a double-strand is formed, one same 
sequence is repeated. Thus, they are called inverted repeat sequences. As shown in Fig. 3, 
in one strand (for example, the sense strand) of the double-strand, inverted repeat sequence 1 
(5'-ataacttcgtata-3'; SEQ ID NO: 4) (the left side in Fig. 3) is complementary, in the 5' ->3' 
direction, to inverted repeat sequence 2 (5'-tatacgaagttat-3'; SEQ ID NO: 5) (the right side in 
Fig. 3) in the 3' 5' direction. 

Unlike ordinary sequences, loxP has directionality. Therefore, when the loxP 
sequence is represented in the above-mentioned 5' — >3' direction in the present invention, 
an arrow pointing the left (e.g. ") vsdll be included in the representation. 

Cre (causes recombination) means a recombinase that causes genetic recombination 
and, upon recognition of the above-described repeats, cleaves the spacer in such a manner 
that "cataca" in the spacer is left as a cohesive end (Fig. 3). 

In bacteria, recombination occurs between two loxP sites, and insertion or deletion 
reaction takes place. If it is possible to cause insertion reaction in mammal cells, then any 
desired gene can be inserted later. This would dramatically expand the applicability of 
gene trapping. Actually, since mammal cells have large nuclei, circular DNA molecules 
with once deleted loxP will difiuse and insertion reaction is hardly observed. 

Toward the solution of the above problems, the present inventors have elaborated a 
method in which mutations are introduced into the loxP sequence in order to cause insertion 
reaction and, once a gene has been inserted into genome, the gene does not imdergo deletion 
(i.e. not removed from the genome). For this method, the inventors have prepared two 
mutant loxP sequences (Fig. 4). 

Briefly, the inventors created one mutant by introducing substitution mutations into 




one of the inverted repeat sequences of loxP (sense strand)(ATAACTTCGTATA; shown at 
the left in Fig. 4b) so that the mutated sequence becomes TACCG TTCGTT (underlined 
portion was changed). This mutant is designated "/ojcTl" (SEQ ED NO: 1; Figure 4b). 
The other mutant was created by introducing substitution mutations into the other inverted 
repeat sequence of loxP (sense strand) (TATACG A AGTTAT; shown at the right in Fig. 4b) so 
that the mutated sequence becomes TATACGAA CGGTA (underlined portion was changed). 
This mutant is designated '7oa:66" (SEQ ID NO: 2; Figure 4b). 

When recombination has occurred between loxll on genome and lox66 on a plasmid, 
a loxP sequence having mutations in both repeats (designated '7ojc71/66"; 
TACCGTTCGTATA GCATACAT TATACGAACQQIA; SEQ ID NO:6) is located on the 5' side 
of the inserted DNA (Fig. 4a, see at the left) and a wild-type loxP sequence 
(AIAACTTCGTATA GCATACAT TATACGAAtiHAE; SEQ ID NO: 3) on the 3' side of the 
inserted DNA (Fig. 4a, see at the left). As a result, Cre no longer can recognize loxl 1/66 
and thus cannot cause recombination with loxP. In the case of homologous recombination 
between two wild-type loxP sequences, a circular DNA containing the excised loxP is 
physically separated. Thus, the reaction tends toward deletion rather than insertion. On 
the other hand, when loxll is used on chromosomes and lox66 is used on circular DNA 
molecules, Cre has difficulty in recognizing the loxll/66 generated as a result of integration 
of the DNA. Thus, the reaction tends toward insertion rather than deletion, and the inserted 
state of insertion is maintained (Fig. 5). It should be noted that, in the present invention, 
lox66 may be used on chromosomes, and loxll may be used on circular DNA molecules. 

Actually, when a mutant loxP (hereinafter, sometimes referred to as "mutant Zojc") 
such as loxll has been integrated into ES cells in advance, and a plasmid containing other 
mutant loxP (e.g. lox66) is introduced thereinto, the plasmid is integrated into the genome. 
Therefore, if this loxll ^ for example, has been integrated into a gene trap vector in advance, 
it becomes possible to insert any desired gene later by using /ox66. Thus, according to the 
present invention, it has become possible to replace the trapped gene with a gene into which 
a subtle mutation(s) has (have) been introduced or a human gene. 

Gene trap vectors using this mutant lox {loxll or lox66) may be constructed as 
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described below (see Fig. 6). Here, it should be noted that the following trap vectors are . 
provided only for illustration, not for limitation. Thus, although lox7l is used as an 
example of a mutant lox in the following vectors, vectors using lox66 instead of loxll are 
also included in the present invention. 

(a) U8: SP-SA-/OX7 l-IRES-M-pA-/oxP-PV-SP 

(b) USdelta: SP-/ojc7 l-IRES-M-pA-/o;cP-PV-SP 

(c) pU-Hachi: SA-ZoxTl-IRES-M-ZoxP-pA-PV-SP 

(d) pU-12: SA-/oA:71-IRES-M-/o;cF-/?MrO'pA-PV-SP 

(e) pU-15: /ojc71-M-/oxP-pA-/ojc2272-PV-/ojc5 1 1 

(f) pU-16: /o;c71-IRES-M-/oxP-pA-/ojc2272-PV-/ox511 

(g) pU-17: (/oJc71 -integrated SA)-M-ZoA:P-pA-/ojc2272-PV-Zojc511 

(h) pU-18: (/oA:71-integrated S A)-IRES-M-/ojcP-pA-/ojc2272-PV-/ojc5 1 1 

(i) ilox7 1 -integrated S A)-M-/ojcP-p A-tojc2272-promoter-M-Zox5 1 1 -SD 

In the above-described vector components, SP represents any sequence; SA represents 
a splice acceptor; SD represents a splice donor; IRES represents an internal ribosomal entry 
site; M represents a marker gene; puro represents puromycin resistance gene; pA represents 
a poly(A) sequence; and PV represents a plasmid vector. 

When trap vectors are integrated into genomic DNA, a part of the vector is deleted in 
most cases and, as a result, an important part of the vector may be missed. SP is any 
sequence added as a dummy to prevent such deletion. This sequence may be selected at 
one's discretion. The length of SP is 100-1000 bp, preferably 300-400 bp. Any known 
sequence may be used as SP. For example, a part of rabbit jS -globin gene may be used. 

The splice acceptor means a sequence that can be linked to the 3' end of an exon at the 
time of splicing. 

The splice donor means a sequence that can be linked to the 5' end of an exon at the 
time of splicing. 

IRES, called "internal ribosomal entry site", is a site on ribosome to which aminoacyl 
t-RNA is bound during protein synthesis (A site), and it is a sequence to allow translation to 
start in a CAP independent manner. 
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The marker gene is a gene that serves as a marker to indicate whether the vector of the 
invention could trap a target gene. Specific examples of marker genes include E, coli- 
derived jS -galactosidase gene (lacZ gene) or a fusion gene between lacZ gene and 
neomycin (G418) resistance gene (/ff-geo gene), CAT gene, GFP gene, SV40 large T gene, 
neomycin resistance gene, puromycin resistance gene, hygromycin resistance gene, and 
blasticidin resistance gene. 

The plasmid vector is used after gene trapping to isolate the endogenous gene by 
plasmid rescue. Plasmid rescue technique is a method for recovering adjacent regions of 
the plasmid (which is replicable in E. coli) integrated in a trap vector using a part of the 
plasmid. For example, when a genomic DNA segment is linked to the plasmid, a fragment 
consisting of the plasmid and the genomic DNA segment linked thereto is excised by 
restriction enzyme treatment. The excised fragment is made circular and introduced into E, 
coli, which is then propagated. As a result, the genomic DNA segment flanking the 
plasmid can be recovered. Specific examples of the plasmid vector include pBR322, pUC 
(pUC18, pUC19, pUCllS, pUC119, etc.), pSP (pSP64, pSP65, etc.), and pGEM (pGEM-3, 
pGEM-4, pGEM3Z, etc.). In addition, supF, ampicillin resistance gene, origin of 
replication, or restriction sites for cloning (e.g. multicloning site) may be linked to the 
plasmid vector independently or in an appropriate combination. 

The vector shown in (a) above is designated "U8". The basic part of U8 (SA-IRES- 
j0 -geo-pA; Fig 7) is derived from pGTl.SIRESbetageo. This pGTl.SIRESbetageo 
contains mouse jEn-2 gene-derived splice acceptor, IRES and jff-geo, loxll is inserted 
into the BgUl site of this plasmid followed by Sail treatment to thereby provide a Sail 
fragment. On the other hand, plasmid pEBN-Seti is prepared by inserting into a vector 
(such as pUC19) a 180 bp SP sequence, loxP and poly(A) signal. The Sail fragment 
obtained above is inserted into the Sail site of this plasmid to produce U8. Thus, the 
structure of this trap vector is expressed as follows (from the 5' end, in this order): any 
sequence, splice acceptor, loxll, IRES, j3-geo, pA, loxP, pUC19, and any sequence (Fig. 
7). 

The vector shown in (b) above is designated "U8delta". U8delta is obtainable by 



deleting the splice acceptor from U8. This vector has a structure in which lox71 is linked 
before the reporter /ff-geo and loxP after jff-geo. This vector was given such a structure 
because the intermediate IRES and /?-geo can be removed completely by transiently 
expressing Cre after the vector has been integrated. As a result, plasmid pUC is located 
close to the mouse endogenous gene which was located upstream of the plasmid. Thus, the 
mouse endogenous gene can be isolated easily. 

The vector shown in (c) above is designated "pU-Hachi". This vector is composed of 
SA-/ox71-IRES-M-/ojcP-pA-PV-SP. pU-Hachi vector is derived from pGTl. SIRES /3 -geo, 
and contains SA sequence from mouse En-2 gene and 0 -geo sequence linked to 
encephalomyocarditis virus-derived IRES sequence. A BawiHI fragment of loxl\ is 
inserted into the BglH site of pGTl. SIRES iS -geo. Then, a plasmid was constructed by 
inserting an SP sequence, a loxP sequence, and poly A addition signal from mouse 
phosphoglycerate kinase- 1 (PGK) into a modified vector from which lacZ sequence has been 
removed. The SP sequence is used to protect the 3' end of the trap vector. pU-Hachi is 
obtainable by inserting a 5a/I fragment of SA-IRES-ZojcVl-i^'-g^o into the Sail site of the 
above plasmid. 

The vector shown in (d) above is designated "pU-12". This vector is composed of 
SA-/ojc71-IRES-M-/ojcP-puro-pA-PV-SP. In order to construct this pU-12 trap vector, first, 
the PGK poly(A) signal of pE3NSE7 is replaced with puromycin resistance gene + PGK 
poly(A) signal. Then, loxSW is inserted into the BgUl site downstream thereof. Then, a 
SaW fragment of SA-IRES-/ox71- j8-geo from pU-Hachi is inserted into the restriction site of 
the resultant plasmid to thereby obtain pU-12. 

The vector shown in (e) above is designated "pU-15". This vector is composed of 
/o;c71-M-/ojcP-pA-/ox2272-PV-/oA:511. "/ox2272" is a mutant loxP in which the spacer 
sequence (gcatacat) is changed to ggatactt (i.e. the second base "c" has been changed to "g", 
and the seventh base "a" to "t")- "/oJcSll" is a mutant loxP in which the spacer sequence 
(gcatacat) is changed to gtatacat (i.e. the second base "c" has been changed to "t"). Since 
lox5 1 1 and lox2212 have mutations in the spacer, they do not cause recombination with other 
loxP sequences such as wild-type loxP or loxll, though two lox5\\ sequences or lox2212 



sequences cause recombination with each other. The order of lox2212 and lox5\\ in the 
vector may be changed. Either one may come first. (This will apply to other vectors 
using these mutants,) 

The vector shown in (f) above is designated "pU-16". This vector is composed of 
/ox71-IRES-M-ZojcP-pA-Zojc2272-PV-Zojc511, and is obtainable by inserting IRES between 
loxll and jS -geo of pU-15. 

The vector shown in (g) above is designated "pU-17'\ In this vector, Zox71 is 
integrated in a region of SA. This vector may be constructed as follows. Briefly, a 
plasmid is constmcted by inserting /ojcSll, loxP^ PGK poly(A) signal and /ojc2272 into, for 
example, pSP plasmid. Then, /ojc71 is inserted into SA in pU-Hachi followed by insertion 
of j8-geo in this order. This plasmid is ligated to the plasmid constructed above to 
thereby obtain pU-17. 

The vector shown in (h) is designated "pU-18". Like pU-17, this vector also has 
/ojc71 integrated in SA. pU-18 is obtainable by inserting IRES between SA and jS-geo of 
pU-17. 

The vector shown in (i) is composed of (/ojc71 -integrated SA)-M-/ojcP-pA-/ojc2272- 
promoter-M-/o;c511-SD. This vector is obtainable by inserting a promoter and M in this 
order into pU-17 instead of PV and ligating SD after /ojcSll. This vector has a promoter 
added thereto. This promoter is not particularly limited. Any promoter may be used. 
For example, bacteria- or yeast-derived promoters described later in the section of 
transformant preparation; RNA polymerase promoters such as SP6 RNA polymerase 
promoter, T7RNA polymerase promoter, T3RNA polymerase promoter; or mammal-derived 
promoters such as EFl (elongation factor 1) promoter, PGK (glycerophosphate kinase) 
promoter, MCI (polyoma enhancer/herpes simplex thymidine kinase) promoter may be 
enumerated. 

2. Gene Trapping 

Two-step gene trapping is performed using the vector prepared as described above. 
The first step is conventional gene trapping. "Conventional gene trapping" means to 




introduce the above trap vector into ES cells and to trap an endogenous gene present 
inherently in the ES cells. By these procedures, the endogenous gene in the ES cells is 
disrupted. Using these ES cells, the knockout mice described later can be prepared. After 
isolation of the trapped endogenous gene (Fig. 8; "gene X"), subtle mutations are introduced 
into this gene in £. coli using site-specific mutagenesis or the like (Fig. 8; "gene X"')- For 
the introduction of mutations into gene X, known techniques such as Kunkel method, gapped 
duplex method, etc. and methods based on these techniques may be used. For example, 
mutations are introduced by using a mutagenesis kit utilizing site-specific mutagenesis (e.g. 
Mutant-K or Mutant-G available fi-om Takara Shuzo) or LA PCR in vitro Mutagenesis series 
kit (Takara Shuzo). 

The second step gene trapping means to introduce into ES cells the mutated 
endogenous gene (gene X') ligated downstream of lox66. By these procedures, 
recombination occurs between the lox71 site of the trap vector introduced in the first step 
and the lox66 site of the vector introduced in the second step. As a result, the modified 
gene can be introduced into ES cells in the form of a cassette composed of [(/ojc71/66)-(gene 



According to these procedures, not only modified endogenous genes but also human 
genes may be introduced. Any gene may be introduced. In the present invention, this 
method is designated exchangeable gene trapping. 

3. Screening for Trap Vector-Integrated Clones (ES Cells) 

If a gene trap vector was introduced into ES cells and then neomycin resistant clones 
have been selected fi-om the resultant cells, these clones are considered to have the trap 
vector integrated downstream of a mouse endogenous gene. DNA is extracted from these 
clones and analyzed by Southern blotting, to thereby select clones in which a single copy of 
the trap vector is integrated. The inventors have found that this selection method enables 
efficient selection of mouse gene-trapping clones. Therefore, this will be used as a 
screening system in the present invention. 

(1) Isolation of Neomycin Resistant Clones 



X')-(loxP)] (Fig. 8). 
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In the present invention, electroporation, microinjection or the like is used for 
introducing trap vectors into ES cells. For example, 100 /xg of trap vector is introduced 
into 3x10^ TT2 ES cells suspended in 0.8 ml of phosphate buffer by electroporation (using a 
BioRad GenePulser at 800 V and 3 /x F), and the resultant cells are cultured in the presence 
of G418 (concentration: 200 U g/ml). After 1 week, neomycin resistant clones are 
isolated. 

The gene trap vector is integrated into the ES cell gnome at random. Therefore, mere 
introduction of the trap vector into ES cells does not necessarily mean integration into a gene. 
The vector may be integrated into a non-gene region. However, since the trap vector 
contains a drug resistance gene neo (neomycin resistance gene), those cells expressing this 
gene are neomycin (also called G418) resistant. In other words, those cells that survive in 
the presence of neomycin are expressing neomycin resistance gene. The neomycin 
resistance gene in the trap vector is expressed only when integrated downstream of a mouse 
gene which is being expressed in the ES cells. Thus, the expression of this neomycin 
resistance gene means that it has been integrated downstream of a certain gene. 
(2) Selection of ES Clones by Integration Pattern 
DNA is extracted from ES clones by conventional methods, and integration patterns 
are analyzed by Southern blotting or the like. When the Southern blot pattern appears as a 
single band, it can be judged that only one copy of the vector is integrated. Therefore, the 
DNA expressing that pattern is selected. These procedures are performed in order to select 
those clones in which isolation of mouse endogenous genes by plasmid rescue will be easy. 
Also, those clones which have become neomycin resistant with only one copy of the vector 
are trapping mouse endogenous genes at an extremely high probability. 

4. Establishment of Trap Lines (Transgenic Animals) by Production 
Of Chimeric Animals 

Chimeric animals are produced by standard methods (Fig. 9). The species of 
chimeric animals produced in the present invention is not particularly limited. For example, 
mouse, rat, guinea pig, rabbit, goat, sheep, pig, dog or the like may be enumerated. In the 
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present invention, mouse is preferable because of its easy handling and propagation. 

ES cells selected with neomycin are aggregated with animal derived-morulae (i.e. 
aggregates of ES cells and morulae are formed) to prepare chimeric animal embryos (e.g. 
those developed to blastocysts). The resultant embryo is transferred into the uterus of a 
foster female animal that has been brought into a pseudo-pregnant state by mating with a 
sterile male animal. If the animal is mouse, offspring will be bom about 17 days after this 
transfer. Chimeric animals are selected from the offspring animals. Although those that 
have a high contribution of chimerism are likely to be gerraline chimeric animals, this can be 
confirmed by crossing such chimeric animals with normal animals. 

Subsequently, chimeric animals are crossed with normal female animals to obtain Fl 
to thereby establish mutant animal lines. The following analysis is conducted only for 
those animals that have been established as trap lines (transgenic animals). Further, 
spermatozoa from Fl and two-cell stage embryos obtained by in vitro fertilization using the 
spermatozoa can be stored frozen by ultra-quick freezing technique. 

( 1 ) Analysis of Expression Patterns 

Fl animals are crossed and then expression patterns in embryos (in the case of mouse, 
9.5-day embryos) and adult animals are analyzed. 

(2) Analysis of Phenotypes 

For each of the established animal lines, phenotypes of heterozygous and homozygous 
animals are analyzed. This analysis is carried out by macroscopic observation, internal 
observation by anatomy, microscopic examination of tissue sections from each organs, 
examination of the skeletal system by X-ray photography, examination of behavior and 
memory, and blood examination. 

(3) Isolation and Structural Analysis of the Trapped Gene and 

Preparation of Chromosome Map 
Trapped DNA is isolated from the trap clone, and the nucleotide sequence thereof is 
determined (as described later). Then, homology search is performed using the sequence 
information obtained. Consequently, the sequence of the trapped DNA is classified into 
one of the groups of known genes, ESTs (expressed sequence tags), unknown genes or 
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repeats. If the DNA is an EST or unknown gene, a chromosome map can be prepared. 
Chromosome maps may be prepared by fluorescent in situ hybridization (FISH), association 
analysis using microsatelite probes or the like, or analysis of hybrid cells by irradiation. 
Once the position of the DNA on the chromosome has been determined, this position is 
compared with the positions of mutant genes in existing mutant mice to examine if the 
relevant position coincides with one of them. 
(4) Construction of Database 
For each of the established lines, database is prepared on expression patterns of the 
marker gene in embryos (in the case of mouse, 10-day embryos) and adult animals; 
phenotypes in Fl and F2 animals; the nucleotide sequence of the trapped endogenous DNA 
and, if the DNA is an EST or unknown gene, its position in the chromosome. 

5 . Knockout Animals 

The knockout animal of the invention is an animal that has been treated so that the 
function of a specific gene is lost. The procedures of such treatment will be described 
below. 

The animals that may be used in the present invention include mouse, rat, guinea pig, 
rabbit, goat, sheep, pig and dog. Preferably, mouse is used in the invention because of its 
easy handling and propagation. 

A genomic DNA fragment containing an unknown gene is obtained by PCR from a 
genomic DNA prepared from animal ES cells or obtained from a genomic library. Then, 
this fragment is integrated into the trap vector of the invention. As a result of this operation, 
the function of the exons in the unknown gene is destroyed. In the trap vector, thymidine 
kinase (tk) gene or diphtheria toxin (DT) gene has been ligated in advance for negative 
selection. This trap vector is introduced into ES cells by electroporation or the like. The 
resultant cells are cultured in the presence of neomycin for positive selection and a nucleic 
acid analogue FIAU (fluoroiodoadenosyluracil) or diphtheria toxin for negative selection. 
Through these selections, only the trap vector-integrated ES cells remain. In these ES 
clones, the genes containing disrupted exons are knocked out. The resultant cells are 
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transferred into the uterus of foster female animals. Then, chimeric animals are selected 
from offspring animals. By crossing these chimeric animals with normal animals, 
heterozygous animals are obtained. Then, homozygotes can be obtained through crossing 
between the heterozygotes. 

In order to confirm that a knockout mouse is obtained, Fl mice are X-ray 
photographed and examined for bone abnormalities (e.g. changes in shape). Alternatively, 
this confirmation may be made by observing the presence or absence of abnormalities in the 
appearance of mice and by observing abnormalities in various tissues and organs at the time 
of anatomy. Also, the confirmation may be made by extracting RNA from tissues and 
analyzing the expression pattern of the relevant gene by Northern blotting. If necessary, 
blood samples may be taken and subjected to blood examination and serum biochemical 
examination. 

6. Isolation of Genes, Construction of Recombinant Vectors and 
Preparation of Transformants 
( 1 ) Isolation of Genes 

In the present invention, genes trapped as described above can be cloned and 
structurally analyzed. 

Isolation of DNA from the trap clone may be performed by conventional techniques. 
For example, if cloning is performed using mRNA from the trap clone, first, total RNA is 
obtained from the trap clone by treating the clone with guanidine reagent, phenol reagent or 
the like. From the total RNA, poly(A+) RNA (mRNA) is obtained by affinity column 
method using, for example, oligo dT-cellulose or poly U-Sepharose containing Sepharose 2B 
as a carrier, or by batch method. Using this mRNA as a template, single-stranded cDNA is 
synthesized using oligo dT primers and a reverse transcriptase. Then, double-stranded cDNA 
is synthesized from the single-stranded cDNA. The thus obtained double-stranded cDNA 
is inserted into an appropriate expression vector (e.g. AgtU) to thereby obtain a cDNA 
library. 

The gene obtained as described above is subjected to sequencing. The sequencing 
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may be performed by known techniques such as the chemical modification method of 
Maxam Gilbert or the dideoxy nucleotide chain termination method using DNA polymerase. 
Usually, the nucleotide sequence of the gene can be determined using an automated 
sequencer. When the 5' region or 3' region of the relevant cDNA is undetermined, the 
entire nucleotide sequence is determined by 5 '-RACE or 3 '-RACE. RACE (Rapid 
Amplification of cDNA Ends) is a well-known technique in the art (Frohman, M.A. et aL, 
Methods EnzymoL Vol. 218, pp. 340-358 (1993)), and kits for performing RACE are 
commercially available (e.g. Marathon™ cDNA Amplification Kit; Clontech). Once the 
nucleotide sequence of the gene of the invention has been determined, the gene can be 
obtained by chemical synthesis or PGR using primers synthesized based on that sequence. 

(2) Construction of Recombinant Vectors 

A gene fi-agment of interest is purified and ligated to vector DNA. As the vector, any 
vector may be used such phage vector or plasmid vector. The technique to ligate DNA of 
interest to vectors is well known in the art (J. Sambrook et aL, Molecular Cloning: A 
Laboratory Manual, Second Edition, Cold Spring Harbor Laboratory Press, 1989). Further, 
recombinant vectors are prepared from the resultant vector and introduced into E. coli or the 
like. Then, appropriate colonies are selected to prepare desired recombinant vectors. 

(3) Transformants 

The transformant of the invention can be obtained by introducing the recombinant 
vector of the invention into a host in such a manner that the gene of interest can be expressed. 
The host is not particularly limited as long as it can express the DNA of the invention. 
Specific examples of the host include bacteria, yeast, animal cells and insect cells. 

When a bacterium such as E. coli is used as the host, it is preferred that the 
recombinant vector of the invention be capable of autonomous replication in the bacterium 
and yet be composed of a promoter, a ribosome binding sequence, the gene of the invention, 
and a transcription termination sequence. A gene that controls the promoter may also be 
included. Specific examples of E. coli include Escherichia coli K12 and DHl, and specific 
examples of Bacillus include Bacillus subtilis. As the promoter, any promoter may be used 
as long as it can direct the expression of the gene of interest in the host such as E. coli. For 
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example, E. coli- or phage-derived promoters such as trp promoter, lac promoter, PL 
promoter and PR promoter may be used. An artificially designed and modified promoter 
such as tac promoter may also be used. As a method for introducing the recombinant 
vector into a host bacterium, any method for introducing DNA into bacteria may be used. 
For example, the method using calcium ions (Cohen, S.N. et aL, Proc, NatL Acad, ScL USA, 
69:2110-2114 (1972)), electroporation (Becker, D.M. et aL, Methods. EnzymoL, 194:182- 
187 (1990) or the like may be used. 

When yeast is used as the host, Saccharomyces cerevisiae, Shizosaccharomyces pombe 
or the like may be used. As the promoter, any promoter may be used as long as it can direct 
the expression of the gene of interest in yeast. For example, gall promoter, gal 10 promoter, 
heat shock protein promoter, MP a 1 promoter, PH05 promoter, PGK promoter, GAP 
promoter, ADH promoter and AOXl promoter may be enumerated. As a method for 
introducing the recombinant vector into yeast, any method for introducing DNA into yeast 
may be used. For example, electroporation, the spheroplast method (Hinnen, A. et aL, Proc, 
Natl Acad, Set USA, 75:1929-1933 (1978), the lithium acetate method (Itoh, H., J. 
BacterioL, 153: 163-168 (1983) or the like may be used. 

When an animal cell is used as the host, COS cells, Vero cells, Chinese hamster ovary 
cells (CHO cells), mouse myeloma cells or the like may be used. As the promoter, SR a 
promoter, SV40 promoter, LTR promoter, EFl promoter, PGK promoter, MCI promoter, or 
the like may be used. Alternatively, human cytomegalovirus early gene promoter or the 
like may be used. As a method for introducing the recombinant vector into an animal cell, 
electroporation, the calcium phosphate method, lipofection, or the like may be used. 

When an insect cell is used as the vector, Sf9 cells, Sf21 cells or the like may be used. 
As a method for introducing the recombinant vector into an insect cell, the calcium 
phosphate method, lipofection, electroporation, or the like may be used. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Fig. 1 is a diagram showing the concept of structural analysis in both gene region and 
non-gene region. 



Fig. 2 is a diagram showing an outline of the construction of a trap vector and the gene 
trapping according to the invention. 

Fig 3 is a diagram showing the structure of loxP, 

Fig. 4 is a diagram showing recombination between loxll and /ojc66. 

Fig. 5 is a diagram showing insertion of a DNA fragment when mutant loxP sequences 
are used. 

Fig. 6 is a diagram showing trap vectors of the invention. 
Fig. 7 is a flow chart showing the construction of a trap vector pU-Hachi. 
Fig. 8 is a diagram showing the two-step exchangeable gene trapping of the invention. 
Fig. 9 is a diagram showing an outline of the establishment of trap lines by production 
of chimeric animals. 

Fig. 10 is a diagram showing various positions of integration of trap vectors. 
Fig. 11 is a photograph showing a bending mutation in the tailbone in mouse. 

BEST MODES FOR CARRYING OUT THE INVENTION 

Hereinbelow, the present invention will be described in more detail with reference to 
the following Examples. It should be noted that the technical scope of the present 
invention is not limited by these Examples. 

EXAMPLE 1 : Construction of Varied-Type Gene Trap Vectors 
(1) Construction of pU-Hachi Trap Vector 

pU-Hachi vector is derived from pGTl. SIRES jS -geo, and contains SA sequence from 
mouse En-2 gene and 0'geo sequence linked to encephalomyocarditis virus-derived IRES 
sequence. First, a BamHl fragment of loxll is inserted into the BglH site of pGTl. SIRES 
iS-geo. Then, plasmid pEBN-SE7ti was constructed by inserting a 180 bp (SP) sequence 
(which is a part of rabbit Q globin gene), loxP sequence, and poly A addition signal from 
mouse phosphoglycerate kinase- 1 (PGK) into a modified vector pUC19 from which lacZ 
sequence has been removed. The SP sequence was used to protect the 3' end of the trap 
vector. By inserting a SaJl fragment of SA-IRES-/ox71- jS-geo into the SaH site of pEBN- 



SE7ti, pU-Hachi was obtained. 

(2) Construction of pU- 1 2 Trap Vector 

In order to construct pU-12 trap vector, first, the PGK poly (A) signal of pE3NSE7 was 
replaced with puromycin resistance gene + PGK poly (A) signal. Then, lox511 was inserted 
into the BglU site downstream thereof to prepare a plasmid. By inserting a Sail fragment of 
SA-IRES-/oj£:71-/?-^eo from pU-Hachi into the Sail site of the above plasmid, pU-12 was 
obtained. 

(3) Construction of pU-17 Trap Vector 

First, loxSll, loxP, PGK poly(A) signal and lox2212 were inserted in this order into 
plasmid pSP73 (Promega) to construct pSP5PP2. Subsequently, pU-Hachi was cleaved at 
one of the two BarriHl sites within SA located upstream of the other. The DNA fragment of 
pU-Hachi up to the upstream BamUl site in SA, lox71 sequence, the DNA fragment of pU- 
Hachi from the downstream BarriHl site in SA to the Kpnl site, and a Ncol-SaH fragment of 
/3-geo were inserted in this order into pBluescriptH KS+ plasmid to construct pKS+S71A jS 
geo. From this plasmid, an XhcH fragment of SA- /8-geo containing loxll was excised and 
inserted into the Spel site of pSP5PP2 to thereby obtain pU-17. 

EXAMPLE 2: Selection of ES Cell Clones 

In the electroporation using pU-Hachi trap vector, lOOMg of 5prf-digested DNA and 
3x10^ cells were used. Cells were suspended in 0.8 ml of PBS and electroporated using a 
BioRad GenePulser at 800 V and 3/xF. After 48 hours, the cells were cultured in the 
presence of 200 /xg/ml G418. This selection was maintained for 7 days. The resultant 
colonies were plated on 24-well plates for propagation and stored frozen. Trap clones were 
analyzed by Southern blotting to select those cell strains that exhibit patterns of single copy 
integration. 

In order to remove sequence from the trap clone, pCAGGS-Cre (Araki, K. et al, 

Proc, Natl. Acad. ScL USA, 92:160-164, 1995; Araki, K. et al, NucL Acids Res., 25:868-872, 
1997; Araki, K. et al., J. Biochem. Tokyo, 122: 977-982, 1997) was electroporated in a 
circular form. This electroporation was carried out under the same conditions as described 
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above except that the number of cells was 1.5x10^ and that the PBS volume was 0.4 ml. 

One half of the thus treated cells were plated onto a 100 mm plate and grown for 48 
hours. Then, the cells were re-plated onto 100 mm plates at 1x10^ cells per plate for colony 
formation. After 1 week, colonies were picked up and expanded for DNA preparation. 

For the co-electroporation experiments designed for targeted integration into the lox71 
site of the trap vector, 20 Ug of each targeting plasmid (p66^IEGPPac, p66^INZPPac or 
p66PGKPac-5) and pCAGGS-Cre were used in circular forms. 

Plasmid p66PGKPac-5 was constructed by inserting a lox66 fragment and PGK 
promoter-puromycin resistance gene coding sequence into pSP73 vector (Promega). 
Plasmid p66^IEGPPac was constructed from pSP73 vector (Promega), IRES sequence, 
EGFP gene (Clontech), PGK promoter. Pea gene and lox66 sequence. Plasmid 
p66^INZPPac was constructed by replacing the PGK gene in p66^IEGPPac with a lacZ gene 
fused to SV40 large T gene-derived nuclear localization signal. 

Cells suspended in PBS (1x10^ cells/0.8 ml) were electroporated at 200 V and 950 u 
F. After 48 hours, the cells were subjected to selection with puromycin at 2 M g/ml for 3 
days. Then, the cells were transferred into a normal medium. Nine days after the 
electroporation, colonies were picked up and expanded. 

Embryoid bodies (EBs) were produced according to a known method (Abe, K., Niwa, 
H. et aL, Exp. Cell Res. 229: 27-34, 1996). 13 -galactosidase activity in ES cells and EBs 
was determined by staining with 5-bromo-4-chloro-3-indolyl /3 -D-galactopyranoside (X- 
gal) as described (Glossier, A. and Zachgo, J., Gene Targeting: A Practical Approach, 
Joyner , A. (ed.), Oxford University Press, Oxford, 1993, pp. 181-227). 

Trap vector pU-Hachi was linearized and introduced into TT2 ES cells. As a result, 
109 clones were isolated. Genomic DNA was prepared from each clone and subjected to 
Southern blotting using a pUC probe and at least 3 restricting enzymes to examine trap 
vector-integration patterns. 

A single band was confirmed in 69% of the clones. Since the presence of /ojc71 site 
is essential for the Cre-mediated integration, the presence was confirmed by Southern 
blotting using a lacZ probe and Pstl digestion. As a result, it was found that loxll was 
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deleted in 10% of the clones (Table 4). The remaining 59% of the clones in which a single 
copy was integrated and yet lox7\ site was maintained were selected for further analysis. 



Table 4 





Single copy integration (%) 


Multi-copy 
integration (%) 




Clones retaining lox71 site 


Clones without lox71 
site 




Total No. 
of clones 
tested 
(%) 


Retaining 
plasmid 
replication 
origin 


Without plasmid 
replication origin 




2-3 
copies 


^ 5 
copies 


109 
(100) 


24 
(22) 


40 

(37) 


1 1 
(10) 


26 
(24) 


8 

(7) 



In order to evaluate the capture of endogenous genes by the trap vector, cells were 
stained with X-gal before and after the formation of embryoid bodies. As shown in Table 5, 
97% of the tested clones exhibited /3-gal activity at a specific stage in differentiation. 
This means that pU-Hachi trap vector performs effective gene trapping comparable to the 
trapping of other IRES- vectors. 



Table 5 



Clone No. (%) 


Expression of jS-geo 


Undifferentiated ES Cells 


Differentiated EEs (Day 8) 


26(41) 


+ 


+ 


32(50) 




+ 


4(6) 


+ 




2(3) 







EXAMPLE 3: Selection Frequency of Clones 

In order to select those clones in which a single copy of the trap vector was integrated, 
DNA was extracted from the selected, neomycin resistant clones and analyzed by Southern 
blotting. 

Briefly, cells were lysed with SDS/proteinase K, treated with phenol/chloroform (1:1, 
vol: vol) twice, precipitated with ethanol, and then dissolved in TE buffer (10 mM Tris-HCl, 
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pH 7.5/1 mM EDTA). Six micrograms of genomic DNA was digested with appropriate 
restriction enzymes, electrophoresed on 0.9% agarose gel and then blotted onto a nylon 
membrane (Boehringer Mannheim). Hybridization was performed using a DIG DNA 
Labeling and Detection Kit (Boehringer Mannheim). 

For PCR analysis, DNA was subjected to 28 cycles of denaturation at 94*C for 1 min, 
annealing at 55*C for 2 min and extension at 72*0 for 2 min in the reaction solution 
described below. 

The primers used in the PCR were as follows: 
jff-geo detection primers 

Zl (forward): 5 -gcgttacccaacttaatcg-3' (SEQ ID NO: 7) 



Z2 (reverse): 5 -tgtgagcgagtaacaacc-3* (SEQ ID NO: 8) 
Primers for detecting the replication origin region in pUC vector 



Ori2 (forward): 5'-gccagtggcgataagtcgtgtc-3' 
Oris (reverse): 5'-cacagaatcaggggataacgc-3' 

Reaction Solution 


(SEQ ID NO: 9) 
(SEQ ID NO: 10) 


10 X PCR buffer 


2 fil 


10 mM dNTP 


0.2 Ml 


Forward primer (100 pmol/ u 1) 


0.2 Ml 


Reverse primer (100 pmol///. 1) 


0.2 Ul 


AmpliTaa DNA polymerase (Perkin Elmer) 


0.2 III 



Total Volume (adjusted with sterilized distilled water) 20 p. 1 

One half of the resultant PCR product was loaded onto agarose gel and analyzed. 

Plasmid rescue (i.e. recovery of the trapped gene) was performed as described below. 

Briefly, genomic DNA (20 U g) was digested with appropriate restriction enzymes and 
ligated in a reaction volume of 400 U 1 to obtain circular molecules. After 
phenol/chloroform extraction and ethanol precipitation, the DNA was suspended in 10 ill 
of TE. Using one half of this DNA suspension, E. coli (STBL2; Life Technologies) was 



transformed by electroporation. The electroporation was performed according to the 
manual of BioRad GenePulser. The electroporated cells were incubated in 1 ml of Circle 
Grow medium (BIO 101) at 30t3 for 1 hour under agitation. Then, after concentration, the 
sample was plated on LB/agar plates followed by selection of plasmids with ampicillin. 
The rescued plasmids were analyzed by restriction mapping and sequencing. Nucleotide 
sequences were determined with Thermo Sequenase Fluorescent-Labeled Primer Cycle 
Sequencing Kit (Amersham). 

As a result, as shown in Table 6, clones in which recombination occurred at a high 
frequency could be obtained. 



Table 6 



Trap Clone 


No. of 


No. of subclones 


Recombination 


Length of the 5' 


Length of the 3' 




subclones 


in which 


frequency (%) 


flanking region 


flanking region 




analyzed 


recombination 




obtained by 


obtained by plasmid 






occurred 




plasmid rescue 


rescue (kb) 










(kb) 




Ayu8-003 


23 


15 


65 


75 


53 


Ayu8-016 


20 


2 


10 


3.8 


4.5 


Ayu8-025 


23 


16 


70 


1.8 


6.5 


Ayu8-104 


12 


5 


42 


3.5 


7 


Ayu8-108 


12 


6 


50 


5 


6 



EXAMPLE 4: Production of Chimeric Mice and Gene Analysis 

(1) Introduction of the Clone into Mice 

The trap ES clone was aggregated with ICR mouse-derived 8 cell stage embryos and 
cultured overnight. On the next day, aggregates of the ES cell and embryo that had 
developed to blastocysts were selected. Approximately 20 of these chimeric embryos were 
transferred into the utems of a foster female mouse that was pre-mated with a sterile male 
mouse. Offspring mice were bom about 17 days thereafter. Eight weeks after birth when 
they became sexually mature, these chimeric mice were crossed with normal female mice to 
obtain ES clone-derived Fl mice. 

(2) Analysis of Phenotypes 

The Fl mice were X-ray photographed, and the presence of absence of abnormalities 
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in the bone was observed. 

(3) Analysis of the Trapped Gene 

Since the trapped gene must be forming a fusion mRNA with j8-geo, the trapped 
gene was identified utilizing this presumption. 

Briefly, mRNA was extracted from X-gal staining-positive tissues of Fl mice. From 
the resultant mRNA, single-stranded cDNA was synthesized with a Thermoscript RT-PCR 
system (GIBCO BRL) using sequences within the SA as primers. Subsequently, a cDNA 
fragment corresponding to the upstream region of the trapped gene that was linked to the 
exon of the SA in the vector was obtained using a 5 'RACE system (GIBCO BRL). The 
resultant cDNA fragment was cloned into a plasmid vector and subjected to sequencing. 

(4) Results 

Table 7 shows one example of the results obtained from the analysis of trapped 

genes. 



Table 7 





Clone No. 


Gene 


Phenotype 


1 


Ayu8-R38 


Spl 




2 


Ayu8-029 


PCMl 

(pericentriol material 1) 




3 


Ayu3-008 


Cyclin B2 




4 


Ayu6-003 


Homologue to the E. coli 
Ftsjl gene 




5 


Ayu8-003 


dynamin n 


Death at 
embryonic stage 


6 


Ayu8-R16 


suil 




7 


Ayu8-016 


Upstream region of hnRNP L 




8 


Ayu8-019 


RNA polymerase I 




9 


Ayu8-108 


importin P 




10 


Ayu8-021 


Unknown 


Kinky tail 



Among the genes obtained as described above, PCMl gene was analyzed. As a 
result, sequences shown in SEQ ID NOS: 11 to 13 (5 'RACE partial fragments) were 
obtained. These sequences matched with a part of the known PCMl gene. Further, the 
mice obtained from Ayu8-021 clone exhibited a mutation of bending in the tail bone (kinky 
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tail) (Fig. 11). This mutant gene fragment was sequenced to thereby obtain the sequence as 
shown in SEQ ID NO: 14. 



All of the publications, patents and patent applications referred to in the present 
specification are incorporated herein by reference in their entirety. 

INDUSTRIAL APPLICABILITY 

The present invention provides gene trap vectors and a method of gene trapping. 
According to the present invention, first, (1) knockout mice can be produced efficiently. In 
most cases, mouse genes are knocked out by the integration of the trap vector into genes. 
Therefore, if the trap vector-introduced ES cells are used, a mouse gene-knocked out mice 
can be produced. That is, knockout mice can be produced efficiently by selection of 
neomycin resistant clones and selection of those clones in which a single copy of the trap 
vector is integrated. According to conventional homologous recombination methods, one 
researcher can produce only 4 lines of knockout mice in one year at his/her best. 
According to the method of the invention, however, one researcher can establish as many as 
240 lines in one year if, for example, he/she establishes 6 lines per week and works 40 weeks 
a year. Thus, the method of the invention is 60 times more efficient than conventional 
methods. 

(2) The method of the invention allows detailed analysis of gene functions. 

In the method of gene trapping of the invention, it is possible to introduce mutations in 
advance into each part of a gene that seems to have a certain function, and integrate the 
resultant mutant genes into trap vectors. Then, the mutant gene-integrated trap vector can 
be introduced into mouse followed by analysis of the phenotype. 

(3) The method of the invention allows the production of disease model mice which 
are closer to human. 

According to the present invention, it is possible to create disease model mice which 
are closer to human than conventional models because a human gene having the same 
mutation as found in a human disease can be introduced into mice replacing the 



corresponding mouse gene. 



SEQUENCE LISTING FREE TEXT 



SEQ ID NO: 1 
SEQ ID NO: 2 
SEQ ID NO: 3 
SEQ ID NO: 4 
SEQ ID NO: 5 
SEQ ID NO: 6 
SEQ ID NO: 7 
SEQ ID NO: 8 
SEQ ID NO: 9 



synthetic DNA 
synthetic DNA 
synthetic DNA 
synthetic DNA 
synthetic DNA 

homologous recombination sequence 
synthetic DNA 
synthetic DNA 
synthetic DNA 



SEQ ID NO: 10: synthetic DNA 
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